Patra
ImprovedCoresetsforEuclideank-Means
In the most general setting, a coreset compresses the data set in such a way that for any set of previously specified candidate queries, the cost of evaluating the query and the cost of the coreset are similar,up to an arbitrarysmalldistortion. A popular subject in coreset literature is the Euclideank-means problem.